Search results for "cluster [track data analysis]"
showing 10 items of 1171 documents
Global emergence of the widespread Pseudomonas aeruginosa ST235 clone
2018
Abstract Objectives Despite the non-clonal epidemic population structure of Pseudomonas aeruginosa , several multi-locus sequence types are distributed worldwide and are frequently associated with epidemics where multidrug resistance confounds treatment. ST235 is the most prevalent of these widespread clones. In this study we aimed to understand the origin of ST235 and the molecular basis for its success. Methods The genomes of 79 P. aeruginosa ST235 isolates collected worldwide over a 27-year period were examined. A phylogenetic network was built, using a Bayesian approach to find the Most Recent Common Ancestor, and we identified antibiotic resistance determinants and ST235-specific genes…
Toward a direct and scalable identification of reduced models for categorical processes.
2017
The applicability of many computational approaches is dwelling on the identification of reduced models defined on a small set of collective variables (colvars). A methodology for scalable probability-preserving identification of reduced models and colvars directly from the data is derived—not relying on the availability of the full relation matrices at any stage of the resulting algorithm, allowing for a robust quantification of reduced model uncertainty and allowing us to impose a priori available physical information. We show two applications of the methodology: (i) to obtain a reduced dynamical model for a polypeptide dynamics in water and (ii) to identify diagnostic rules from a standar…
A clustering package for nucleotide sequences using Laplacian Eigenmaps and Gaussian Mixture Model.
2018
International audience; In this article, a new Python package for nucleotide sequences clustering is proposed. This package, freely available on-line, implements a Laplacian eigenmap embedding and a Gaussian Mixture Model for DNA clustering. It takes nucleotide sequences as input, and produces the optimal number of clusters along with a relevant visualization. Despite the fact that we did not optimise the computational speed, our method still performs reasonably well in practice. Our focus was mainly on data analytics and accuracy and as a result, our approach outperforms the state of the art, even in the case of divergent sequences. Furthermore, an a priori knowledge on the number of clust…
Dissection of DLBCL microenvironment provides a gene expression-based predictor of survival applicable to formalin-fixed paraffin-embedded tissue
2018
Abstract Background Gene expression profiling (GEP) studies recognized a prognostic role for tumor microenvironment (TME) in diffuse large B-cell lymphoma (DLBCL), but the routinely adoption of prognostic stromal signatures remains limited. Patients and methods Here, we applied the computational method CIBERSORT to generate a 1028-gene matrix incorporating signatures of 17 immune and stromal cytotypes. Then, we carried out a deconvolution on publicly available GEP data of 482 untreated DLBCLs to reveal associations between clinical outcomes and proportions of putative tumor-infiltrating cell types. Forty-five genes related to peculiar prognostic cytotypes were selected and their expression …
Pharmacogenomics of Scopoletin in Tumor Cells
2016
Drug resistance and the severe side effects of chemotherapy necessitate the development of novel anticancer drugs. Natural products are a valuable source for drug development. Scopoletin is a coumarin compound, which can be found in several Artemisia species and other plant genera. Microarray-based RNA expression profiling of the NCI cell line panel showed that cellular response of scopoletin did not correlate to the expression of ATP-binding cassette (ABC) transporters as classical drug resistance mechanisms (ABCB1, ABCB5, ABCC1, ABCG2). This was also true for the expression of the oncogene EGFR and the mutational status of the tumor suppressor gene, TP53. However, mutations in the RAS onc…
Patterns of Eating and Physical Activity Attitudes and Behaviors in Relation to Body Mass Index
2016
The aim of the study was to identify and characterize the patterns of the psychological and behavioral characteristics, in relation to body mass index. In addition, the study examined the associations between the patterns and demographic characteristics, exercise, eating habits, and healthrelated psychological variables. Participants were 361 Greek adults, randomly selected and completed self-reported questionnaires. The surveys examined demographic characteristics, healthrelated psychological variables (attitudes and intentions toward exercise and healthy eating, perceived behavioral control, health locus of control, general health, self-control, and body image) and the behaviors of exerci…
MicroRNA as crucial regulators of gene expression in estradiol-treated human endothelial cells.
2018
Background/Aims: Estrogen signalling plays an important role in vascular biology as it modulates vasoactive and metabolic pathways in endothelial cells. Growing evidence has also established microRNA (miRNA) as key regulators of endothelial function. Nonetheless, the role of estrogen regulation on miRNA profile in endothelial cells is poorly understood. In this study, we aimed to determine how estrogen modulates miRNA profile in human endothelial cells and to explore the role of the different estrogen receptors (ERα, ERβ and GPER) in the regulation of miRNA expression by estrogen. Methods: We used miRNA microarrays to determine global miRNA expression in human umbilical vein endothelial cel…
FastaHerder2: Four Ways to Research Protein Function and Evolution with Clustering and Clustered Databases.
2016
The accelerated growth of protein databases offers great possibilities for the study of protein function using sequence similarity and conservation. However, the huge number of sequences deposited in these databases requires new ways of analyzing and organizing the data. It is necessary to group the many very similar sequences, creating clusters with automated derived annotations useful to understand their function, evolution, and level of experimental evidence. We developed an algorithm called FastaHerder2, which can cluster any protein database, putting together very similar protein sequences based on near-full-length similarity and/or high threshold of sequence identity. We compressed 50…
Innovative Strategies to Develop Chemical Categories Using a Combination of Structural and Toxicological Properties.
2016
Interest is increasing in the development of non-animal methods for toxicological evaluations. These methods are however, particularly challenging for complex toxicological endpoints such as repeated dose toxicity. European Legislation, e.g., the European Union's Cosmetic Directive and REACH, demands the use of alternative methods. Frameworks, such as the Read-across Assessment Framework or the Adverse Outcome Pathway Knowledge Base, support the development of these methods. The aim of the project presented in this publication was to develop substance categories for a read-across with complex endpoints of toxicity based on existing databases. The basic conceptual approach was to combine str…
Snapshots of a shrinking partner: Genome reduction inSerratia symbiotica
2016
AbstractGenome reduction is pervasive among maternally-inherited endosymbiotic organisms, from bacteriocyte- to gut-associated ones. This genome erosion is a step-wise process in which once free-living organisms evolve to become obligate associates, thereby losing non-essential or redundant genes/functions. Serratia symbiotica (Gammaproteobacteria), a secondary endosymbiont present in many aphids (Hemiptera: Aphididae), displays various characteristics that make it a good model organism for studying genome reduction. While some strains are of facultative nature, others have established co-obligate associations with their respective aphid host and its primary endosymbiont (Buchnera). Further…